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ENDOTHELIAL PAS DOMAIN PROTEIN 



The research carried out in the subject application was supported in part by grants 
5 from the National Institutes of Health. The government may have rights in any patent issuing 
on this application. 

INTRODUCTION 

Field of the Invention 

j 0 The field 0 f this invention is transcription factor proteins involved in vascularization. 

Background 

Roughly a dozen proteins classified as basic helix-loop-helix/PAS domain 
transcription factors have been described in both vertebrates and invertebrates. Members of 

1 5 this class derive their name from the shared presence of a basic helix-loop-helix (bHLH) 

motif that specifies sequence dependent recognition of DNA and a PAS domain composed of 
two imperfect repeats. PAS is an acronym derived from the first three proteins observed to 
contain this motif. These include the product of the jzeriW gene of Drosophila melanogaster 
(Jackson et al. 1986; Citri et al. 1987), the aryl hydrocarbon nuclear transporter gene (ARNT) 

20 of mammals (Burbach et al. 1992), and the product of the fruit fly single-minded gene 

(Nambu et al. 1991). 

The imperfect, direct repeats within the PAS domain are approximately 50 amino 
acids in length and contain a signature His-X-X-Asp sequence in each repeat. Three 
biochemical functions have been assigned to the PAS domain. First, it acts in concert with 

25 the helix-loop-helix domain of bHLH/PAS proteins to form a dimerization surface (Reisz- 
Porszasz et al. 1994; Fukunaga et al. 1995; Lindebro et al. 1995). In the case of the period 
gene product, which lacks a bHLH domain, the PAS domain specifies heterodimerization 
with the product of the timeless locus (Gekakis et al. 1995; Myers et al. 1995). Interaction 
between the period and timeless gene products represents a crucial event in the control of 

30 circadian rhythm in fruit flies (Hunter-Ensor et al. 1996; Lee et al. 1996; Myers et al. 1996; 
Zeng et al 1996). In contrast, the aryl hydrocarbon receptor (AHR) heterodimerizes with 
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ARNT via PAS domain interactions (Fukunaga et al. 1995), producing a heterodimer that is 
competent for nuclear gene interaction. Second, the PAS domain mediates interaction with 
heat shock protein 90 (HSP-90). Several PAS domain proteins, including the single-minded 
gene product and the AHR, can be sequestered in the cytoplasm in an inactive state. 
Maintenance of the inactive state involves interactions between the PAS domain and HSP-90 
5 (Perdew, 1988; Chen and Perdew, 1994; Henry and Gasiewicz, 1993; McGuire et al. 1995). 
Finally, the PAS domain of the AHR facilitates high affinity binding of certain xenobiotic 
compounds including dioxin (reviewed in Hankinson, 1995; Schmidt and Bradfield, 1996). 

PAS domain transcription factors perform diverse functions in a variety of cell types 
and organisms. The period gene product helps regulate circadian rhythm in fruit flies 

10 (Konopka and Benzer, 1971), whereas the mammalian AHR provides response to xenobiotics 
by activating genes whose products facilitate detoxification (Schmidt and Bradfield, 1996). A 
more recently described member of the PAS domain family, hypoxia inducible factor (HIF- 
lct), activates genes whose products regulate hematopoiesis in response to oxygen deprivation 
(Wang et al. 1995). In Drosophita, the single-minded gene product affects neurogenesis 

15 (Nambu et al. 1991) and the trachealess gene product controls the formation of tubular 
structures in the embryo (Wilk et al. 1996; Isaac and Andrew, 1996). 

The utilization of bHLH/PAS domain proteins in diverse species and physiological 
processes raises the possibility that this family of transcription factors might consist of many 
undiscovered members. Here we report the initial characterization of new members of this 

20 protein family collectively designated endothelial PAS domain protein 1 (EPAS1)/ 

SUMMARY OF THE INVENTION 
The invention provides methods and compositions relating to endothelial PAS domain 
protein 1 (EPAS1), related nucleic acids, and protein domains thereof having EPAS1 -specific 

25 activity. EPAS1 proteins can regulate specification of endothelial tissue, such as vasculature, 
the blood brain barrier, etc. The proteins may be produced recombinantly from transformed 
host cells from the subject EPAS1 encoding nucleic acids or purified from mammalian cells. 
The invention provides isolated EPAS1 hybridization probes and primers capable of 
specifically hybridizing with the disclosed EPAS1 gene, EPAS1 -specific binding agents such 

30 as specific antibodies, and methods of making and using the subject compositions in 

diagnosis (e.g. genetic hybridization screens for EPAS1 transcripts), therapy (e.g. gene 
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therapy to modulate EPAS1 gene expression) and in the biopharmaceutical industry (e.g. as 
immunogens, reagents for isolating B-cell specific activators or other transcriptional 
regulators, reagents for screening chemical libraries for lead pharmacological agents, etc.). 

SEQ ID NO: LISTING 
5 SEQ ID NO: 1 : human EPAS 1 cDNA. 

SEQ ID NO:2: murine EPAS1 cDNA. 

SEQ ID NO:3 : HIF- 1 a binding site. 

SEQ ID NO:4: human EPAS1 protein. 

SEQ ID NO:5: murine EPAS 1 protein. 
1 0 SEQ ID NO:6: human HIF-1 aprotein. 

SEQ ID NO:7: murine HEF-la protein 

DETAILED DESCRIPTION OF THE INVENTION 
The nucleotide sequence of a natural cDNA encoding a human and murine EPAS1 

1 5 proteins are shown as SEQ ID NOS: 1 and 2, respectively, and the full conceptual translates as 
SEQ ID NOS:4 and 5, respectively. The EPAS 1 proteins of the invention include incomplete 
translates of SEQ ID NOS:l and 2 and deletion mutants of SEQ ID NOS:4 and 5, which 
translates and deletion mutants have EPAS 1 -specific amino acid sequence and binding 
specificity or function. Such active EPAS1 deletion mutants, EPAS1 peptides or protein 

20 domains comprise at least 14, preferably at least about 16, more preferably at least about 20 
consecutive residues of SEQ ID NO:4 or 5. For examples, EPAS1 protein domains identified 
below are shown to provide dimerization, protein-binding, and nucleic acid binding function. 
Additional such domains are identified in and find use, inter alia, in solid-phase binding 
assays as described below. 

25 EPAS 1 -specific activity or function may be determined by convenient in vitro, cell- 

based, or in vivo assays: e.g. in vitro binding assays, cell culture assays, in animals (e.g. 
immune response, gene therapy, transgenics, etc.), etc. Binding assays encompass any assay 
where the molecular interaction of an EPAS1 protein with a binding target is evaluated. The 
binding target may be a natural intracellular binding target such as another bHLH/PAS 
30 protein, a heat shock protein, or a nucleic acid sequence/binding site or other regulator that 
directly modulates EPAS1 activity or its localization; or non-natural binding target such a 
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specific immune protein such as an antibody, or an EPAS 1 specific agent such as those 
identified in screening assays such as described below. EPAS 1 -binding specificity may 
assayed by binding equilibrium constants (usually at least about 10 7 M"', preferably at least 
about 10* M' 1 , more preferably at least about 10 9 M" 1 ), by the ability of the subject protein to 
function as negative mutants in EPAS 1 -expressing cells, to elicit EPAS1 specific antibody in 
a heterologous host (e.g a rodent or rabbit), etc. In any event, the EPAS1 binding specificity 
of the subject EPAS1 proteins necessarily distinguishes HIF-lct. 

The claimed EPAS1 proteins are isolated or pure: an "isolated" protein is 
unaccompanied by at least some of the material with which it is associated in its natural state, 
preferably constituting at least about 0.5%, and more preferably at least about 5% by weight 
of the total protein in a given sample and a pure protein constitutes at least about 90%, and 
preferably at least about 99% by weight of the total protein in a given sample. The EPAS1 
proteins and protein domains may be synthesized, produced by recombinant technology, or 
purified from mammalian, preferably human cells. A wide variety of molecular and 
biochemical methods are available for biochemical synthesis, molecular expression and 
purification of the subject compositions, see e.g. Molecular Cloning, A Laboratory Manual 
(Sambrook, et al. Cold Spring Harbor Laboratory), Current Protocols in Molecular Biology 
(Eds. Ausubel, et al., Greene Publ. Assoc., Wiley-Interscience, NY) or that are otherwise 
known in the art. 

The invention provides natural and non-natural EPAS 1 -specific binding agents, 
methods of identifying and making such agents, and their use in diagnosis, therapy and 
pharmaceutical development. For example, EPAS 1 -specific agents are useful in a variety of 
diagnostic and therapeutic applications. Novel EPAS 1 -specific binding agents include 
EPAS 1 -specific receptors, such as somatically recombined protein receptors like specific 
antibodies or T-cell antigen receptors (see, e.g Harlow and Lane (1988) Antibodies, A 
Laboratory Manual, Cold Spring Harbor Laboratory) and other natural intracellular binding 
agents identified with assays such as one-, two- and three-hybrid screens, non-natural 
intracellular binding agents identified in screens of chemical libraries such as described 
below, etc. For diagnostic uses, the binding agents are frequently labeled, such as with 
fluorescent, radioactive, chemiluminescent, or other easily detectable molecules, either 
conjugated directly to the binding agent or conjugated to a probe specific for the binding 
agent. Agents of particular interest modulate EPAS1 function, e.g. EPAS 1 -dependent 

4 
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transcriptional activation; for example, isolated cells, whole tissues, or individuals may be 
treated with an EPAS 1 binding agent to activate, inhibit, or alter EPAS 1 -dependent 
transcriptional processes. 

The amino acid sequences of the disclosed EPAS1 proteins are used to back-translate 
EPAS1 protein-encoding nucleic acids optimized for selected expression systems (Holler et 

5 al. (1993) Gene 136, 323-328; Martin et al. (1995) Gene 154, 150-166) or used to generate 
degenerate oligonucleotide primers and probes for use in the isolation of natural EPAS1- 
encoding nucleic acid sequences ("GCG" software, Genetics Computer Group, Inc, Madison 
WI). EPAS 1 -encoding nucleic acids used in EPAS 1 -expression vectors and incorporated into 
recombinant host cells, e.g. for expression and screening, transgenic animals, e.g. for 

1 0 functional studies such as the efficacy of candidate drugs for disease associated with EPAS 1 - 
modulated transcription, etc. 

The invention also provides nucleic acid hybridization probes and replication / 
amplification primers having a EPAS 1 cDNA specific sequence contained in SEQ ID NO: 1 
and sufficient to effect specific hybridization thereto (i.e. specifically hybridize with SEQ ID 

15 NO:l in the presence of endothelial cell cDNA). Such primers or probes are at least 12, 
preferably at least 24, more preferably at least 36 and most preferably at least 96 bases in 
length. Demonstrating specific hybridization generally requires stringent conditions, for 
example, hybridizing in a buffer comprising 30% formamide in 5 x SSPE (0.18 M NaCl, 0.0 1 
M NaP0 4 , pH7.7, 0.001 M EDTA) buffer at a temperature of 42°C and remaining bound 

20 when subject to washing at 42°C with 0.2 x SSPE; preferably hybridizing in a buffer 

comprising 50% formamide in 5 x SSPE buffer at a temperature of 42°C and remaining 
bound when subject to washing at 42°C with 0.2 x SSPE buffer at 42°C. EPAS1 cDNA 
homologs can also be distinguished from other protein using alignment algorithms, such as 
BLASTX (Altschul et al. (1990) Basic Local Alignment Search Tool, J Mol Biol 215, 403- 

25 410). 

The subject nucleic acids are of synthetic/non-natural sequences and/or are isolated, 
i.e. unaccompanied by at least some of the material with which it is associated in its natural 
state, preferably constituting at least about 0.5%, preferably at least about 5% by weight of 
total nucleic acid present in a given fraction, and usually recombinant, meaning they comprise 
30 a non-natural sequence or a natural sequence joined to nucleotide(s) other than that which it is 
joined to on a natural chromosome. Nucleic acids comprising the nucleotide sequence of 
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SEQ ID NO:l or 2 or fragments thereof, contain such sequence or fragment at a terminus, 
immediately flanked by a sequence other than that which it is joined to on a natural 
chromosome, or flanked by a native flanking region fewer than 10 kb, preferably fewer than 2 
kb, which is at a terminus or is immediately flanked by a sequence other than that which it is 
joined to on a natural chromosome. While the nucleic acids are usually RNA or DNA, it is 
often advantageous to use nucleic acids comprising other bases or nucleotide analogs to 
provide modified stability, etc. 

The subject nucleic acids find a wide variety of applications including use as 
translatable transcripts, hybridization probes, PCR primers, diagnostic nucleic acids, etc.; use 
in detecting the presence of EPAS1 genes and gene transcripts and in detecting or amplifying 
nucleic acids encoding additional EPAS1 homologs and structural analogs. In diagnosis, 
EPAS1 hybridization probes find use in identifying wild-type and mutant EPAS1 alleles in 
clinical and laboratory samples. Mutant alleles are used to generate allele-specific 
oligonucleotide (ASO) probes for high-throughput clinical diagnoses. In therapy, 
therapeutic EPAS1 nucleic acids are used to modulate cellular expression or intracellular 
concentration or availability of active EPAS1. 

The invention provides efficient methods of identifying agents, compounds or lead 
compounds for agents active at the level of a EP AS 1 modulatable cellular function. 
Generally, these screening methods involve assaying for compounds which modulate EPAS1 
interaction with a natural EPAS1 binding target. A wide variety of assays for binding agents 
are provided including labeled in vitro protein-protein binding assays, immunoassays, cell 
based assays, etc. The methods are amenable to automated, cost-effective high throughput 
screening of chemical libraries for lead compounds. Identified reagents find use in the 
pharmaceutical industries for animal and human trials'; for example, the reagents may be 
derivatized and rescreened in in vitro and in vivo assays to optimize activity and minimize 
toxicity for pharmaceutical development. Target indications include neoproliferative disease, 
inflammation, hypersensitivity, wound healing, immune deficiencies, infection etc. 

In vitro binding assays employ a mixture of components including an EPAS1 protein, 
which may be part of a fusion product with another peptide or polypeptide, e.g. a tag for 
detection or anchoring, etc. The assay mixtures comprise a natural intracellular EPAS1 
binding target. While native binding targets may be used, it is frequently preferred to use 
portions (e.g. peptides) thereof so long as the portion provides binding affinity and avidity to 

6 
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the subj ect EP AS 1 protein conveniently measurable in the assay. The assay mixture also 
comprises a candidate pharmacological agent. Candidate agents encompass numerous 
chemical classes, though typically they are organic compounds; preferably small organic 
compounds and are obtained from a wide variety of sources including libraries of synthetic or 
natural compounds. A variety of other reagents may also be included in the mixture. These 

5 include reagents like salts, buffers, neutral proteins, e.g. albumin, detergents, protease 
inhibitors, nuclease inhibitors, antimicrobial agents, etc. may be used. 

The resultant mixture is incubated under conditions whereby, but for the presence of 
the candidate pharmacological agent, the EPASl protein specifically binds the cellular 
binding target, portion or analog with a reference binding affinity. The mixture components 

10 can be added in any order that provides for the requisite bindings and incubations may be 
performed at any temperature which facilitates optimal binding. Incubation periods are 
likewise selected for optimal binding but also minimized to facilitate rapid, high-throughput 
screening. 

After incubation, the agent-biased binding between the EPAS 1 protein and one or 
15 more binding targets is detected by any convenient way. For cell-free binding type assays, a 
separation step is often used to separate bound from unbound components. Separation may 
be effected by precipitation (e.g. TCA precipitation, immunoprecipitation, etc.), 
immobilization (e.g on a solid substrate), etc., followed by washing by, for examples, 
membrane filtration (e.g. Whatman's P-81 ion exchange paper, Polyfiltronic's hydrophobic 
20 GFC membrane, etc.), gel chromatography (e.g. gel filtration, affinity, etc.). For EPAS 1 - 
dependent transcription assays, binding is detected by a change in the expression of an 

EPAS 1 -dependent reporter. 

Detection may be effected in any convenient way. For cell-free binding assays, one of 

the components usually comprises or is coupled to a label. The label may provide for direct 
25 detection as radioactivity, luminescence, optical or electron density, etc. or indirect detection 

such as an epitope tag, an enzyme, etc. A variety of methods may be used to detect the label 

depending on the nature of the label and other assay components, e.g. through optical or 

electron density, radiative emissions, nonradiative energy transfers, etc. or indirectly detected 

with antibody conjugates, etc. 
30 A difference in the binding affinity of the EPAS 1 protein to the target in the absence 

of the agent as compared with the binding affinity in the presence of the agent indicates that 
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the agent modulates the binding of the EP AS 1 protein to the EP AS 1 binding target. 
Analogously, in the cell-based transcription assay also described below, a difference in the 
EPAS1 transcriptional induction in the presence and absence of an agent indicates the agent 
modulates EPAS1 -induced transcription. A difference, as used herein, is statistically 
significant and preferably represents at least a 50%, more preferably at least a 90% difference. 

The following experimental section and examples are offered by way of illustration 
and not by way of limitation. 

EXPERIMENTAL 

cDNAs encompassing the coding region of the human EPAS1 were isolated by screening 
a HeLa cell cDNA library with a radiolabeled probe derived from an expressed sequence tag 
(#T70415) obtained from the Genbank database (see Materials and Methods). Multiple cDNA 
clones were isolated and subjected to DNA sequence analysis to derive the conceptually 
translated protein sequence of human EPAS1 shown in Table 1 . The predicted M r of the human 
EPAS1 was 96,528. A termination codon was located 24 nucleotides 5' of the designated 
initiator methionine in the human sequence. cDNAs encoding the murine homologue were 
isolated from an adult mouse brain cDNA library using a probe obtained by reverse transcriptase 
polymerase chain reactions with oligonucleotide primers derived from the human EPAS1 cDNA 
sequence (see Materials and Methods). The predicted protein sequence of murine EPAS1 is 
aligned and compared with the human sequence in Table 1. The two proteins share 88% 
sequence identity. Data base searches revealed that the human and murine EPAS1 proteins share 
extensive primary amino acid sequence identity with hypoxia inducible factor- la (HIF-la), a 
member of the bHLH/PAS domain family of transcription factors (Wang et al. 1995; Wenger et 
al. 1995). EPAS1 and HIF-la share 48% primary amino acid sequence identity as revealed by 
the alignment shown in Table 1. Sequence conservation between the two proteins is highest in 
the basic-helix-loop-helix (85%), PAS A (68%) and PAS-B (73%) regions. A second region of 
sequence identity occurs at the extreme carboxy terminis of the EPAS1 and HIF-la proteins. 
This conserved region in mHIF 1 a has been recently shown to contain a hypoxia response domain 
(Li et al., 1996). EPAS1 also shares sequence relatedness with other PAS domain proteins, 
however the degree of similarity between EPAS1 and other family members is less striking than 
that between HIF-la and EP AS 1 . 

Genomic clones encoding the human EPAS1 transcript were isolated by screening 
bacteriophage libraries of human DNA. The intron-exon structure of the gene was established 
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by comparison of DNA sequences obtained from the genomic DNA to that of the cDNA. The 
coding region of EPAS1 is specified by 15 exons. The exonic sequences mapped to six 
non-overlapping bacteriophage lambda clones whose average insert size was 20 kb, indicating 
that the EPAS1 gene spans at least 120 kb of genomic DNA. A comparison of the EPAS1 gene 
structure with that of the aryl hydrocarbon receptor (Schmidt et al. 1993) reveals that the 

5 positions of introns within the regions encoding the amino-terminal halves of the two proteins 
are highly conserved. In contrast, the portion of the EPAS1 gene specifying the carboxy-terminal 
half of the protein is interrupted by seven introns, whereas the AHR gene contains only a single 
intron in this region. Thus the 5'-ends of the two genes may have arisen from an ancient gene 
duplication event, whereas the 3'-regions have a more recent evolutionary origin. 

! o Two methods were used to determine the chromosomal location of the human EPAS 1 

gene. Fluorescent in situ hybridization (FISH) analysis was performed using a biotinylated probe 
containing exons 8-14 of the EPAS1 gene. This analysis revealed a single hybridization signal 
over chromosome 2, bands P 16-p21. As a second assay for gene localization, an oligonucleotide 
primer pair derived from exon 8 was used to amplify a segment of the EPAS1 gene from the 

1 5 genomic DN As of a radiation hybrid panel. Computer-assisted analysis of the results indicated 
linkage of the EPAS 1 gene to the D2S288 marker on chromosome 2p with a LOD score of 8.7 
and a cR8000 value of 12.96. Thus, the data obtained from two independent mapping methods 
consistently positioned the EPAS1 gene on the short arm of chromosome 2 and indicate that the 
EPAS1 gene is non-syntenic with the HIF-la gene, which maps to chromosome 14q21-24 

20 (Semenzaet al. 1996). 

The high degree of sequence similarity between the EPAS1 and HIF-la proteins raises 
the possibility that they share a common physiological function. To test this hypothesis, RNA 
blotting experiments were used to compare and contrast the distributions of EPAS1 and HIF-la 
mRNAs in a variety of human tissues. An EPAS1 mRNA of approximately 5.8 kb was detected 

25 in all tissues examined with the single exception of peripheral blood leukocytes. Among the 
positive tissues, highly vascularized organs such as the heart, placenta and lung showed the 
highest levels of EPAS 1 mRNA. A HIF-la mRNA of approximately 4.4 kb was detected in all 
human tissues. In contrast to EPAS 1 mRNA, however, peripheral blood leukocytes contained 
very high levels of HIF- 1 a mRNA. Likewise, we observed no enrichment of HIF- 1 a mRNA in 

30 highly vascularized tissues. 

These RNA blotting data indicate that, with few exceptions, most. tissues express both 
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EPAS1 and HIF-lct mRNAs. To determine if this overlap extended to the cellular level, in situ 
mRNA hybridization was used to determine the cell type specific expression patterns of the two 
gene products. Sections from day 1 1 and day 13 mouse embryos were examined first. In day 
1 1 embryo sections, EPAS1 transcripts were observed almost exclusively in endothelial cells of 
the intersegmental blood vessels separating the somites, the atrial and ventricular chambers of 
the heart, and the dorsal aorta. Extra-embryonic membranes, such as the yolk sac, which are 
highly vascularized, also expressed abundant levels of EPAS1 mRNA. In the developing brain 
of a day 13 embryo, endothelial cells of the highly vascularized choroid plexus contained 
abundant EPAS1 transcripts. The brain section also revealed intense EPAS1 mRNA 
hybridization in the endothelial cells of a blood vessel lying along the edge of post-mitotic 
neurons emanating from the lateral ventricle region. When a nearby section was hybridized with 
an anti-sense probe that was specific for the HIF-lct mRNA, only a diffuse signal somewhat over 
background was detected, indicating a low level of HIF-la expression in many cell types. In 
contrast to the results with the EPAS1 probe, no concentration of HIF-la mRNA was detected 
in the endothelial cells of the adjacent blood vessel. A differential expression pattern between 
EPAS1 and HIF-la was also apparent in the region of the embryo containing the umbilicus. 
EPAS1 transcripts were detected in the endothelium of blood vessels within this structure, 
whereas HIF-la mRNA was concentrated in the mesenchyme surrounding the vascular 
endothelium. 

In tissues of adult mice, EPAS 1 mRNA was also detected at high levels in endothelial 
cells, yet was also present at lower levels in several additional cells types. For example, decidual 
cells of the placenta contained very high levels of EPAS 1 mRNA as did parenchymal tissue in 
the lung. The distinction between EPAS1 expressing cell types and HIF-la expressing cells was 
also apparent in adult tissues, A section through the cortex of the kidney showed EPAS1 
expression in the mesangial cells. In contrast, HIF-la expression was found in the cells of the 
collecting ducts. Taken together, these in situ mRNA hybridization results reveal very divergent 
patterns of EPAS 1 and HIF- 1 a mRNA distribution. 

The presence of basic helix-loop-helix and PAS domain motifs in EPAS1 raised the 
possibility that this protein might be capable of forming a complex with the aryl hydrocarbon 
receptor nuclear transport protein (ARNT) (Hoffman et al. 1991), and that the resulting 
heterodimer might exhibit sequence-specific DN A binding. To test these predictions, EPAS 1 and 
ARNT expression vectors were used to program a reticulocyte lysate. The EPAS1 expression 
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vector was modified at its carboxy-terminus with a c-Myc epitope tag to facilitate immunological 
detection of the EPAS1 translation product. Radiolabeled methionine was included in the 
translation mix containing the ARNT mRNA, whereas unlabeled methionine was used in the 
EPAS1 reaction. After translation, the two reactions were mixed and subsequently incubated 
with a monoclonal antibody that recognizes the c-Myc epitope present on the EPAS1 protein. 
5 Under these conditions the c-Myc antibody was capable of immunoprecipitating the radiolabeled 
ARNT protein only when EP AS 1 -Myc protein was present in the reaction. 

The bHLH domains of HIF-la and EPAS1 are nearly identical in primary amino acid 
sequence. Thus, to test for the ability of EPAS1 to form a functional heterodimer with ARNT, 
we used a HIF-la response element derived from the 3'-flanking region of the erythropoietin 
1 0 gene (Semenza and Wang, 1 992) in gel mobility shift assays with in vitro translated proteins. 
The data showed that a new complex was formed when both EPAS1 and ARNT were included 
in the DNA binding reaction, and that this complex was specifically recognized by an 
anti-peptide antibody directed against the EPAS1 protein. Competition experiments using a 
100-fold excess of unlabeled competitor DNA containing the HIF-la response element, or a 
1 5 response element with three point mutations in this sequence, indicated that EP AS 1 exhibited 
sequence-specific binding properties. Taken together, the data indicate that EPAS1 is capable 
of binding the HIF- 1 a response element in the presence of the ARNT protein. 

The ability of EP AS 1 to trans-activate a reporter gene containing the HIF- 1 a response 
element was tested by transient transfection. Expression vectors in which either EP AS 1, HIF-la, 
20 or ARNT were placed under the control of a cytomegalovirus promoter were constructed. Two 
luciferase reporter constructs were prepared. One contained nucleotides -105 through +58 of the 
herpes simplex virus thymidine kinase promoter (McKnight et al. 1981) linked to three copies 
of the HIF-la response element from the erythropoietin gene (pRE-tk-LUC). The other 
contained a TATA sequence from the adenovirus major late gene promoter (Lillie and Green, 
25 1989) linked to the same three HIF-la response elements (pElB-LUC). Combinations of these 
plasmids were then transfected into cultured human embryonic kidney 293 cells and the 
expression of luciferase enzyme activity was monitored in cell lysates 16-20 hours 
post-transfection. The data showed that EPAS1 induced a 12-fold increase in luciferase enzyme 
activity when transfected in the absence of the ARNT vector. Cotransfection of the ARNT 
30 expression vector with low levels of EPAS1 expression vector did not increase the 
EPAS1 -mediated induction of luciferase activity, suggesting that this cell line might contain 



11 



BNSDOCIO: <WO 9831 701 At J_> 



WO98/31701 



PCT/US98/00813 



adequate amounts of endogenous ARNT to support heterodimer formation with EPAS1. A 
seven-fold stimulation of luciferase activity was also obtained when larger amounts of the HIF- 
ia expression plasmid were introduced into 293 cells . The introduction of three point mutations 
into the core sequence of the hypoxia response element eliminated both EPAS1 -dependent and 
HIF- 1 a-dependent activation of the reporter gene. 

The potential of HIF- la to induce expression of target genes is increased by both hypoxia 
and pharmacological compounds that mimic hypoxia in cells, such as desferoxamine (DFX) and 
cobalt chloride (CoCl 5 ) (Wang et al. 1995). To determine if EPAS1 activity might also be 
stimulated by these agents, 293 cells were incubated under hypoxic conditions or treated with 
DFX or CoCU prior to transfection with the plasmids. Pretreatment of cells under conditions that 
mimic hypoxia increased expression from the luciferase construct in the absence of exogenous 
EPAS1 or HIF- la. This trans-activation presumably arises from endogenous HIF- la or EPAS1 
proteins whose mRNAs are present in 293 cells. As noted above, introduction of the EPAS1 
expression vector led to 5- to 10 times higher levels of luciferase activity over those seen in 
mock-transfected cells. An extra 2 to 4-fold stimulation of luciferase expression was observed 
upon pretreatment with CoCl 2 , DF, or hypoxia relative to that measured in EPAS1- transfected 
but untreated cells. Of the three conditions, pretreatment with CoCl, led to a slightly larger 
increase in EPAS1 activity, resulting in a four-fold higher level of luciferase activity over that 
detected in untreated cells. As has been observed in previous studies (Jiang et al. 1996; Forsythe 
et al. 1996), hypoxic conditions also stimulated the ability of HIF-la to trans-activate the target 
gene containing the hypoxia response element. 

The EPAS1 expression vector was also tested for its ability to activate a reporter gene 
(pRE-Elb-LUC) following transfection into murine hepatoma cells (Hepalclc7) that express 
ARNT, as well as in a mutant line derived from these parental cells that does not express ARNT 
(c4 variant, Legraverend et al. 1982). Expression of EPAS1 in the Hepalclc7 cells led to a 
nine-fold increase in luciferase activity. Transfection of EPAS1 alone into c4 cells increased 
luciferase enzyme activity only slightly (1.8-fold) whereas cotransfection of EPAS 1 and ARNT 
led to a 12-fold stimulation of activity. These findings are consistent with the interpretation that 
EPAS1 forms an active heterodimeric transcription factor with ARNT, and they confirm the 
results showing heterodimerization of these two proteins obtained in coimmunoprecipitation and 

gel mobility shift assays. 

The experiments demonstrating the functional activity of EPAS 1 utilized a hypoxia 
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response element derived from the erythropoietin gene, which is a known target gene for HEF- 1 a 
(Semenza and Wang, 1992). Despite the activity of EPAS1 in these assays, as well as the high 
degree of sequence similarity between HIF-lct and EPAS1, the in situ mRNA hybridization 
results indicate that the two proteins are expressed in different cell types and thus might activate 
different target genes. The high level of expression of EPAS1 in endothelial cells raises the 

5 possibility that the EPAS1 protein might activate genes whose expression is limited to 
endothelial cells. To test this hypothesis, we transfected 293 cells with a c-Myc-tagged EPAS1 
expression vector and a marker gene composed of the S'-flanking region of the Tie-2 gene linked 
to p-galactosidase. Tie-2 encodes a tyrosine kinase receptor that is specifically expressed in cells 
of endothelial lineage (Dumont et al. 1992; Maison-Pierre et al. 1993; Sato et al. 1993; Schnurch 

10 and Risau, 1993). The data showed that EPAS1 potently stimulated expression of the 
r/e-2-driven reporter gene, and that the degree of stimulation correlated with the level of 
immunodetectable EPAS1 in the transfected cells. Surprisingly, little or no transcriptional 
activation of the Tie-2 reporter gene by HIF-lct was detected, even though equivalent amounts 
of HIF-lct and EPAS1 proteins were expressed in the 293 cells. 

! 5 These data reveal that EP AS 1 proteins and nucleic acids provide reagents to modulate the 

formation of the endothelial tissues including vasculature, the blood brain barrier, etc. and to 
modulate cellular or tissue responsiveness to oxygenation, hypoxia and other hemodynamic 
stimuli. 

cDNA and genomic cloning, chromosomal mapping 
20 In the course of screening for genes that are differentially expressed in prostate 

adenocarcinoma versus normal tissue, a cDNA encoding a bHLH/PAS domain protein was 
isolated. Data base searches generated several expressed-sequence tags that showed sequence 
similarity to this family of transcription factors. EPAS1 cDNAs correspond to the human 
expressed sequence tag #T70415 in the Genbank collection and were isolated by a combination 

25 of reverse transcriptase polymerase chain reactions and screening of a HeLa cell cDNA library 
(Yokoyama et al. 1993) using standard methods. Similar approaches were used to isolate the 
murine homologue from a commercially available mouse adult brain cDNA library (#837314, 
Stratagene Corp., La Jolla, CA). A human HIF-lct cDNA was generated by ligation of an 
amplified cDNA fragment to expressed sequence tag hbc025 (Takeda et al. 1993). Bacteriophage 

30 X clones harboring genomic DNA inserts corresponding to the human EPAS1 gene were isolated 
by screening a commercially available fibroblast genomic library (XFIXII vector, #946204, 
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Stratagene Corp.) 

Fluorescence in situ hybridization to identify the chromosomal localization of the human 
EPASl gene was carried out as previously described (Craig and Bickmore, 1994). This analysis 
indicated hybridization to the short arm of chromosome 2, bands pi 6-21. To confirm the 
assignment, a 269 bp segment of exon 8 from the EPASl gene was amplified from the 83 
5 genomic DNAs of a radiation hybrid panel (Stanford G3 panel, Research Genetics, Huntsville, 
AL) using oligonucleotide primers and a thermocycler program consisting of 35 cycles of 94°C/1 
min, 68°C/1 min. Analysis of the results via an e-mail server at Stanford University indicated 
linkage to the D2S288 marker (logarithm of the odds score of 8.7, cR_8000 value of 12.96), 
which is located approximately 82 centimorgans from the telomere of the short arm of 
1 0 chromosome 2 (MIT Center for Genome Research). 
RNA blotting and in situ hybridization 

Human multiple tissue RNA blots (Clontech Laboratories, Palo Alto, CA) were probed 
with EPASl and HIF-lct cDNA probes using Rapid-Hyb from Amersham Corp. (Arlington 
Heights, IL). For in situ mRNA hybridization, mouse tissues were fixed in 4% 
15 paraformaldehyde, sectioned at 5 urn thickness, and subjected to in situ mRNA hybridization as 
described (Berman et al. 1995). A [»P]-labeled antisense RNA probe recognizing the EPASl 
mRNA was derived by in vitro transcription of an -300 bp DNA fragment encoding amino acids 
225-327 of the sequence shown in Table 1 . A segment of the murine HIF-la cDNA encoding 
amino acids 41-125 was isolated by reverse transcriptase-polymerase chain reactions using 
20 mRNA template isolated from embryonic day 10 mouse embryo. 
Co-immunoprecipitation experiments 

Human EPASl and mouse ARNT proteins were generated in vitro using a 
transcription-translation kit (TNT System, Promega Corp., Madison, WI). cDNAs encoding full- 
length proteins were subcloned into the pcDNA3 vector (Invitrogen Corp., San Diego, CA) prior 
25 to coupled transcription/translation. For immunoprecipitation, approximately 5 ul of each 
reaction were transferred to a separate tube, mixed well and subsequently diluted by the addition 
of 500 ul of ice-cold buffer (20 mM Hepes-KOH, pH 7.4/ 100 mM KC1/ 10% (v/v) glycerol/ 
0.4% (v/v) Nonidet P-40/ 5 mM EGTA/ 5 mM EDTA/ 100 ug/ml bovine serum albumin/ 1 mM 
dithiothreitol) (Huang et al. 1993). The diluted mixture was incubated with 1 ul (0.1. ug) of 
30 anti-Myc monoclonal antibody 9E10 (Santa Cruz Biotechnology, Santa Cruz, CA) for 2 hours 
at 4°C. A 10 ul aliquot of beads (~4 x 10 6 in number, Dynal Corp., Lake Success, NY) coated 



14 



BNSOOCID: <WO 9831701A1J_> 



• WO 98/31701 PCT/US98/00813 

with rat anti-mouse IgGl antibody were then added followed by a further incubation for 1 hour 
at 4°C. Beads were washed three times with 1.5 ml of the above buffer and bound proteins were 
subsequently analyzed by electrophoresis through 8% polyacrylamide gels containing SDS. 
Gel retention assays 

EPAS1 and ARNT cDNAs were translated in vitro as described above. Gel retention 
5 assays were performed as described previously (Semenza and Wang, 1992) using a 
double-stranded oligonucleotide probe radiolabeled with the Klenow fragment ofE. coli DNA 
polymerase I and containing an HIF-lct binding site (5'-GCCCTACGTGCTGTCTCA-3\ SEQ 
ID NO:3) from the erythropoietin gene (Semenza and Wang, 1992). For supershift assays, a 
polyclonal antibody was raised against residues 1 to 10 of the human EPAS1 protein by standard 
10 methods and 1 ul of serum was added to the gel retention reaction mixture prior to the 30 minute 
incubation at 4°C. A preimmune serum served as a negative antibody control. 
Transient transfection assays 

The pTK-RE3-luc reporter plasmid was constructed by inserting three copies of a 
50-nucleotide hypoxia-inducible enhancer from the erythropoietin gene (Semenza and Wang, 
15 1992) into pGL3-TK. The 77e-2-P-galactosidase reporter gene P T2HLacZpAlI.7, containing 
10.3 kb of 5'-flanking DNA from the murine Tie-2 gene was obtained from the Cardiovascular 
Division, Beth Israel Hospital, Boston, MA. Human embryonic kidney 293 cells (ATCC 
CRL#1573) were cultured in Dulbecco's modified Eagle's medium (DMEM, low glucose; 
Gibco-BRL) supplemented with 10% fetal calf serum. The murine hepatoma cell line Hepalclc7 
20 and the c4 ARNT deficient mutant derived from this line were maintained as described 
previously (Legraverend et al. 1982). Approximately 24 hours before transfection, cells were 
inoculatedin 12-well plates at a density of 120,000 cells per well. Plasmid DNA (1-10 ug) was 
transfected into cells using a kit (MBS, Stratagene Corp., La Jolla, CA). Cells were allowed to 
recover for 3 hours at 35°C in a 3% CO, atmosphere. Where indicated, 125 uMCoCl 2 (#C3169, 
25 Sigma Chem. Corp., St. Louis, MO) or 130 uM desferoxamine (#D9533, Sigma) were added 
to cells at this time and the incubation continued for an additional 16 hours in atmospheres 
containing 20% or 1% O z . Luciferase and p-galactosidase enzyme activities were determined 
according to the manufacturer 1 s instructions (Tropix, Bedford, MA). Reporter gene expression 
was normalized by cotransfection of a p-galactosidase expression vector (pCMV-P-gal) and/or 
30 to expression obtained from the pGL3-Control plasmid (Promega Corp., Madison, WI). Levels 
of expressed c-Myc epitope-tagged EP AS 1 or HIF- 1 a were assessed by immunoblotting with 
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the anti-Myc monoclonal antibody 9E10 (Santa Cruz Biotechnology, Santa Cruz, CA) using a 

protocol supplied by the manufacturer. 
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EXAMPLES 

30 1 . Protocol for high throughput EPAS 1-ARNT complex formation assay. 
A. Reagents: 
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- Nftiitralite Avidin : 20 ng/ml in PBS. 

- R acking buffer : 5% BSA, 0.5% Tween 20 in PBS; 1 hour at room temperature. 

- A^av Buffer : 100 mM KC1, 20 mM HEPES pH 7.6, 1 mM MgCl 2 , 1% glycerol, 0.5% 
NP-40, 50 mM p-mercaptoethanol, 1 mg/ml BSA, cocktail of protease inhibitors. 

. H p FPAS1 protein lOx stock : 10" 8 - ia 6 M "cold" EPAS1 supplemented with 200,000- 
250,000 cpm of labeled EPAS1 (Beckman counter). Place in the 4°C microfridge during 
screening. 

. PptP flse inhibitor cocktail HOOOXV . 10 mg Trypsin Inhibitor (BMB # 109894), 10 mg 
Aprotinin (BMB # 236624), 25 mg Benzamidine (Sigma # B-6506), 25 mg Leupeptin (BMB # 
1017128), 10 mg APMSF (BMB # 917575), and 2mM NaVo 3 (Sigma # S-6508) in 10 ml of PBS. 

- ARNT : lO' 7 - 1 0" 5 M biotiny lated ARNT in PBS. 

B. Preparation of assay plates: 

- Coat with 120 \i\ of stock N-Avidin per well overnight at 4°C. 

- Wash 2 times with 200 ^il PBS. 

- Block with 150 ^1 of blocking buffer. 

- Wash 2 times with 200 jil PBS. 

C. Assay: 

- Add 40 \il assay buffer/well. 

- Add 10 |il compound or extract. 

- Add 10 ii\ 33 P-EPAS1 protein (20-25,000 cpm/0.1-10 pmoles/well =10' 9 - 10" 7 M final 

cone). 

- Shake at 25°C for 1 5 minutes. 

- Incubate additional 45 minutes at 25°C. 

- Add 40 ^1 biotinylated hTFII subunit (0. 1 - 1*0 pmoles/40 ul in assay buffer) 

- Incubate 1 hour at room temperature. 

- Stop the reaction by washing 4 times with 200 fil PBS. 

- Add 150 ul scintillation cocktail. 

- Count in Topcount. 

D. Controls for all assays (located on each plate): 

a. Non-specific binding 

b. Soluble (non-biotiny lated EPAS1) at 80% inhibition. 
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2. Protocol for high throughput human EPAS 1/ARNT- DNA complex formation assay. 

A. Reagents: 

- MPiitralite Avidin : 20 ug/ml in PBS. 

- RW.ldng buffer : 5% BSA, 0.5% Tween 20 in PBS; 1 hour at room temperature. 

- Assay Buffer : 100 mM KC1, 20 mM HEPES pH 7.6, 1 mM MgCl 2 , 1% glycerol, 0.5% 
5 NP-40, 50 mM p-mercaptoethanol, 1 mg/ml BSA, cocktail of protease inhibitors. 

. u p human EPA <q pmtrin 10x stock : l0 " 8 " t^M "cold" human EPAS1 subunit (pi 05) 
supplemented with 200,000-250,000 cpm of labeled human EPAS 1 (Beckman counter). Place 
in the 4°C microfridge during screening. 

. PptPas* inhibit™- mr.Wm1 fi 000X1 : 10 mg Trypsin Inhibitor (BMB # 109894), 10 mg 
10 Aprotinin (BMB # 236624), 25 mg Benzamidine (Sigma # B-6506), 25 mg Leupeptin (BMB # 
1 017128), 10 mg APMSF (BMB # 917575), and 2mM NaVoj (Sigma # S-6508) in 10 ml of PBS. 

- DNA : 1 0- 7 - 1 0- 4 M biotinylated DNA (SEQ ID NO:3) in PBS. 

- AT? NT protein : 10 7 - 10 5 M ARNT in PBS. 

B. Preparation of assay plates: 

1 5 . Coat with 1 20 ul of stock N- Avidin per well overnight at 4°C. 

- Wash 2 times with 200 u,l PBS. 

- Block with 150 ul of blocking buffer. 

- Wash 2 times with 200 ul PBS. 

C. Assay: 

20 - Add 40 ul assay buffer/well. 

- Add 10 jal compound or extract. 

-Add 10ul 33 P-hEPASl protein (20-25,000 cpm/0. 1-1 0pmoles/well=10" 9 - ia 7 Mfinal). 

- Add 1 Oul ARNT protein. 

- Shake at 25°C for 15 minutes. 

25 - Incubate additional 45 minutes at 25°C. 

- Add 40 ul biotinylated DNA (0.1-10 pmoles/40 ul in assay buffer) 

- Incubate 1 hour at room temperature. 

- Stop the reaction by washing 4 times with 200 ul PBS. 

- Add 1 50 ul scintillation cocktail. 
30 - Count in Topcount. 

D. Controls for all assays (located on each plate): 
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a. Non-specific binding 

b. Soluble (non-biotinylated EPAS1/ARNT combination) at 80% inhibition. 

All publications and patent applications cited in this specification are herein incorporated 
by reference as if each individual publication or patent application were specifically and 
individually indicated to be incorporated by reference. Although the foregoing invention has 
been described in some detail by way of illustration and example for purposes of clarity of 
understanding, it will be readily apparent to those of ordinary skill in the art in light of the 
teachings of this invention that certain changes and modifications may be made thereto without 
departing from the spirit or scope of the appended claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: McKnight , Steven L. 

Russell, David W. 
Tian, Hui 

5 (ii) TITLE OF INVENTION: Endothelial PAS Domain Protein 

(iii) NUMBER OF SEQUENCES: 7 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 

(B) STREET: 268 BUSH STREET, SUITE 3200 
10 (C) CITY: SAN FRANCISCO 

(D) STATE: CALIFORNIA 

(E) COUNTRY: USA 

(F) ZIP: 94104 

(v) COMPUTER READABLE FORM: 

15 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.3 0 
(vi) CURRENT APPLICATION DATA: 

20 (A) APPLICATION NUMBER: US 08/785,241 

(B) FILING DATE: 17-JAN-1997 

(C) CLASSIFICATION: 
(viii) ATTORNEY / AG ENT INFORMATION: 

(A) NAME: OSMAN, RICHARD A 
25 (B) REGISTRATION NUMBER: 36,627 

(C) REFERENCE / DOCKET NUMBER: UTSD:122 9 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE:- (415) 343-4341 

(B) TELEFAX: (415) 343-4342 

30 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2816 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 

23 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CCTGACTGCG CGGGGCGCTC GGGACCTGCG CGCACCTCGG ACCTTCACCA CCCGCCCGGG 
CCGCGGGGAG CGGACGAGGG CCACAGCCCC CCACCCGCCA GGGAGCCCAG GTGCTCGGCG 
TCTGAACGTC TCAAAGGGCC ACAGCGACAA TGACAGCTGA CAAGGAGAAG AAAAGGAGTA 
GCTCGGAGAG GAGGAAGGAG AAGTCCCGGG ATGCTGCGCG GTGCCGGCGG AGCAAGGAGA 
5 CGGAGGTGTT CTATGAGCTG GCCCATGAGC TGCCTCTGCC CCACAGTGTG AGCTCCCATC 300 

TGGACAAGGC CTCCATCATG CGACTGGAAA TCAGCTTCCT GCGAACACAC AAGCTCCTCT 
CCTCAGTTTG CTCTGAAAAC GAGTCCGAAG CCGAAGCTGA CCAGCAGATG GACAACTTGT 
ACCTGAAAGC CTTGGAGGGT TTCATTGCCG TGGTGACCCA AGATGG CG AC ATGATCTTTC 
TGTCAGAAAA CATCAGCAAG TTCATGGGAC TTACACAGGT GGAGCTAACA GGACATAGTA 
10 TCTTTGACTT CACTCATCCC TGCGACCATG AGGAGATTCG TGAGAACCTG AGTCTCAAAA 

ATGGCTCTGG TTTTGGGAAA AAAAGCAAAG ACATGTCCAC AGAGCGGGAC TTCTTCATGA 
GGATGAAGTG CACGGTCACC AACAGAGGCC GTACTGTCAA CCTCAAGTCA GCCACCTGGA 
AGGTCTTGCA CTGCACGGGC CAGGTGAAAG TCTACAACAA CTGCCCTCCT CACAATAGTC 
TGTGTGGCTA CAAGGAGCCC CTGCTGTCCT GCCTCATCAT CATGTGTGAA CCAATCCAGC 
15 ACCCATCCCA CATGGACATC CCCCTGGATA GCAAGACCTT CCTGAGCCGC CACAG CATGG 

ACATGAAGTT CACCTACTGT GATGACAGAA TCACAGAACT GATTGGTTAC CACCCTGAGG 
AGCTGCTTGG CCGCTCAGCC TATGAATTCT ACCATGCGCT AGACTCCGAG AACATGACCA 
AGAGTCACCA GAACTTGTGC ACCAAGGGTC AGGTAGTAAG TGGCCAGTAC CGGATGCTCG 
CAAAGCATGG GGGCTACGTG TGGCTGGAGA CCCAGGGGAC GGTCATCTAC AACCCTCGCA 
20 ACCTGCAGCC CCAGTGCATC ATGTGTGTCA ACTACGTCCT GAGTGAGATT GAGAAGAATG 

ACGTGGTGTT CTCCATGGAC CAGACTGAAT CCCTGTTCAA GCCCCACCTG ATGG CCATG A 
ACAGCATCTT TGATAGCAGT GGCAAGGGGG CTGTGTCTGA GAAGAGTAAC TTCCTATTCA 
CCAAGCTAAA GGAGGAGCCC GAGGAGCTGG CCCAGCTGGC TCCCACCCCA GGAGACGCCA 
TCATCTCTCT GGATTTCGGG AATCAGAACT TCGAGGAGTC CTCAGCCTAT GGCAAGGCCA 1440 
25 TCCTGCCCCC GAGCCAGCCA TGGGCCACGG AGTTGAGGAG CCACAGCACC CAGAGCGAGG 1500 

CTGGGAGCCT GCCTGCCTTC ACCGTGCCCC AGGCAGCTGC CCCGGGCAGC ACCACCCCCA 1560 
GTGCCACCAG CAGCAGCAGC AGCTGCTCCA CGCCCAATAG CCCTGAAGAC TATTACACAT 1620 
CTTTGGATAA CGACCTGAAG ATTGAAGTGA TTGAGAAGCT CTTCGCCATG GACACAGAGG 1680 
CCAAGGACCA ATGCAGT AC C CAGACGGATT TCAATGAGCT GGACTTGGAG ACACTGGCAC 1740 
30 CCTATATCCC CATGGACGGG GAAGACTTCC AGCTAAGCCC CATCTGCCCC GAGGAGCGGC 1800 

TCTTGGCGGA GAACCCACAG TCCACCCCCC AG CACTGCTT CAGTGCCATG ACAAACATCT 1860 
TCCAGCCACT GGCCCCTGTA GCCCCGCACA GTCCCTTCCT CCTGGACAAG TTTCAGCAGC 
AGCTGGAGAG CAAGAAGACA GAGCCCGAGC ACCGGCCCAT GTCCTCCATC TTCTTTGATG 
CCGGAAGCAA AGCATCCCTG CCACCGTGCT GTGGCCAGGC CAGCACCCCT CTCTCTTCCA 
35 TGGGGGGCAG ATCCAATACC CAGTGGCCCC CAGATCCACC ATTACATTTT GGGCCCACAA 

AGTGGGCCGT CGGGGATCAG CGCACAGAGT TCTTGGGAGC AGCGC CGTTG GGGCCCCCTG 
TCTCTCCACC CCATGTCTCC ACCTTCAAGA CAAGGTCTGC AAAGGGTTTT GGGGCTCGAG 

24 



60 
120 
180 
240 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



1920 
1980 
2040 
2100 
2160 
2220 
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GCCCAGACGT 


GCTGAGTCCG 


GCCATGGTAG 


CCCTCTCCAA 


CAAGCTGAAG 


CTGAAGCGAC 


2280 


AGCTGGAGTA 


TGAAGAGCAA 


GCCTTCCAGG 


ACCTGAGCGG 


GGGGGACCCA 


CCTGGTGGCA 


2340 


GCACCTCACA 


TTTGATGTGG 


AAACGGATGA 


AGAACCTCAG 


GGGTGGGAGC 


TGCCCTTTGA 


2400 


TGCCGGACAA 


GCCACTGAGC 


GCAAATGTAC 


CCAATGATAA 


GTTCACCCAA 


AACCCCATGA 


2460 


GGGGCCTGGG 


CCATCCCCTG 


AG AC AT CTG C 


CGCTGCCACA 


GCCTCCATCT 


GCCATCAGTC 


2520 


CCGGGGAGAA 


CAGCAAGAGC 


AGGTTCCCCC 


CACAGTGCTA 


CGCCACCCAG 


T AC C AGG ACT 


2580 


ACAGCCTGTC 


GTCAGCCCAC 


AAGGTGTCAG 


GCATGGCAAG 


CCGGCTGCTC 


GGGCCCTCAT 


2640 


TTGAGTCCTA 


CCTGCTGCCC 


GAACTG AC C A 


GATATGACTG 


TGAGGTGAAC 


GTGCCCGTGC 


2700 


TGGGAAG CTC 


CACGCTCCTG 


CAAGGAGGGG 


ACCTCCTCAG 


AGCCCTGGAC 


CAGGCCACCT 


2760 


GAGCCAGGCC 


TTCTACCTGG 


GCAGCACCTC 


TGCCGACGCC 


GTCCCACCAG 


CTTCAC 


2816 



10 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 031 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



20 



25 



30 



35 



CGACAGAGAG 


CTGCGGAGGG 


CCACAGCAAA 


GAGAGCGGCT 


GCAGCCCCTA 


CGGGGTTAAG 


60 


GAACCCAGGT 


GCTCCGGGTC 


TCGGAGGGCC 


ACGGCGACAA 


TGACAGCTGA 


CAAGGAGAAA 


120 


AAAAGGAGCA 


GCTCAGAGCT 


GAGGAAGGAG 


AAATCCCGTG 


ATGCCGCGAG 


GTGCCGGCGC 


180 


AGCAAGGAGA 


CGGAGGTCTT 


CTATGAGTTG 


GCTCATGAGT 


TGCCCCTGCC 


TCACAGTGTG 


240 


AGCTCCCACC 


TGGACAAAGC 


CTCCATCATG 


CGCCTGGCCA 


TCAGCTTCCT 


TCGGACACAT 


3 00 


AAGCTCCTGT 


CCTCAGTCTG 


CTCTGAAAAT 


GAATCTGAAG 


CTGAGGCCGA 


C C AGCAAATG 


360 


GATAACTTGT 


ACCTGAAAGC 


CTTGGAGGGT 


TTCATTGCTG 


TGGTGACCCA 


AGACGGTGAC 


420 


ATGATCTTTC 


TGTCGGAAAA 


CATCAGCAAG 


TTCATGGGAC 


TTACTCAGGT 


AGAACTAACA 


480 


GGACACAGCA 


TCTTTGACTT 


CACTCATCCT 


TGCGACCATG 


AAGAGATCCG 


TGAGAACCTG 


540 


ACTCTCAAAA 


ACGGCTCTGG 


TTTTGGGAAG 


AAGAGCAAAG 


ACGTGTCCAC 


CGAGCGTGAC 


600 


TTCTTCATGA 


GGATGAAGTG 


CACGGTCACC 


AACAGAGGCC 


GGACTGTCAA 


CCTCAAGTCG 


660 


GCCACCTGGA 


AGTCCGTCCT 


GCACTGCACC 


GGGCAAGTGA 


GAGTCTACAA 


CAACTGCCCC 


720 


C CTC AC AGTA 


GCCTCTGTGG 


CTCCAAGGAG 


CCCCTGCTGT 


CCTGCCTTAT 


CATCATGTGT 


780 


GAGCCAATCC 


AGCACCCATC 


C C AC ATGG AC 


ATCCCCCTGG 


ACAGCAAGAC 


TTTCCTGAGC 


840 


CGCCACAGCA 


TGGACATGAA 


GTTCAC CTAC 


TGTGACGACA 


GAATCTTGGA 


ACTGATTGGT 


900 


TACCACCCCG 


AGGAGCTACT 


TGGACGCTCT 


GCCTATGAGT 


TTTAC CATGC 


C CTGG ATTCG 


960 


GAGAACATGA 


CCAAAAGTCA 


CCAGAACTTG 


TGCACCAAGG 


GGCAGGTGGT 


ATCTGGCCAG 


1020 


TACCGGATGC 


TAGCCAAACA 


CGGAGGATAT 


GTGTGG CTGG 


AGACCCAGGG 


GACGGTCATC 


1080 


TACAACCCCC 


GCAACCTGCA 


GCCTCAGTGT 


ATCATGTGTG 


TCAACTATGT 


G CTG AGTGAG 


1140 



25 
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ACGACGTGGT 


GTTCTCCATG 


GACC AG AC CG 


AATCCCTGTT 


CAAGCCACAC 


1200 


ptgatggppa 


TGAACAGCAT 


CTTTGACAGC 


AGTGACGATG 


TGGCTGTAAC 


TGAGAAGAGC 


1260 


AAPTAPHTnT 
rtJ-\.v- I. V_ X \J X 


TCACCAAACT 


GAAGGAGGAG 


CCCGAGGAAC 


TGGCCCAGTT 


GGCCCCCACC 


1320 


p p a an atg 


CCATTATTTC 


TCTCGATTTC 


GG AAG C C AG A 


ACTTCGATGA 


ACCCTCAGCC 


1380 


tatggpaagg 


CCATCCTTCC 


CCCGGGCCAG 


CCATGGGTCT 


CGGGG CTGAG 


GAGCCACAGT 


1440 


cccc a g 21 g p g 


AGTCCGGGAG 


CCTGCCAGCC 


TTCACTGTGC 


CCCAGGCAGA 


CACCCCAGGG 


1500 




PPAGTGCTTC 


AAG CAGCAGT 


AGCTGCTCCA 


CGCCCAGCAG 


CCCTGAGGAC 


1560 


TACTA 1 I LA x 


rrTTfiRAGAA 


TCCCTTGAAG 


ATCGAAGTGA 


TTGAGAAGCT 


TTTCG CCATG 


1620 




nGZiGnGACCC 

LVjAUwUn^ v- w 


GGGCAGTACC 


CAGACGGACT 


TCAGTGAACT 


GGATTTGGAG 


1680 


AC CTTGG C At- 


r* r"r a p a t p P c 


TATGGACGGC 


GAGGACTTCC 


AGCTGAGCCC 


CATCTGCCCA 


1740 


GAG GAG C CG C 


rpAl atGPP AGA 


GAGCCCCCAG 


CCCACCCCCC 


AGCACTGCTT 


CAGTACCATG 


1800 


ACCAGCATCT 


Tr"*/-* ft r* c*r*d PT 


P ACCC CGGGG 


GCCACCCACG 


GCCCCTTCTT 


CCTCGATAAG 


1860 


TACCCGCAGC 


Asj I 1 uuAHHVj 


P AGG AAG AC A 


GAGTCTGAGC 


ACTGGCCCAT 


GTCTTCCATC 


1920 


TTCTTTGATG 


A-»HTT" , /"* , AfT!r" , a A 
L. 1 (juoAULAn, 


AGGGTCCCTG 


TCTCCATGCT 


GTGGCCAGGC 


CAGCACCCCT 


1980 


CTCTCTTCTA 


rrr*r*r* A rrir* a f2 
I UubAbULHU 


ATCGAACACG 


CAGTGGCCCC 


CGGATCCACC 


ATTACATTTC 


2040 


GGCCCTACTA 


a /^rn^t/***/**^*T , f!* r r 


GGGTGATCAG 


AGTG CTG AAT 


CCCTGGGAGC 


CCTGCCGGTG 


2100 


GGGTCATGGC 


A /^"T^rp/^ A A /"^T^ 

AGTTGtjAAC I 




CCGCTTCATG 


TCTCCATGTT 


CAAGATGAGG 


2160 


TCTGCAAAGG 


ACTTCuwout 




TACATGATGA 


GCCCAGCCAT 


GATCGCCCTG 


2220 


TCCAACAAGC 


TG AAG CTAAft 




GAGTATGAGG 


AG C AAGCCTT 


CCAAGACACA 


2280 


AGCGGGGGGG 


AC CCTL. L.ALtVj 




TC AC AC TTG A 


TGTGGAAACG 


TATGAAGAGC 


2340 


CTCATGGGCG 


GGACCTG1LL 


1 x ivjniwvv^ x 


G AC AAG AC CA 


TCAGTGCGAA 


CATGGCCCCC 


2400 


GATGAATTCA 


An ai At it k k 7v n nn/^ 
CCCAAAAA 1 v_ 


TaTnAfSAGGf 1 


CTGGGCCAGC 


CACTGAGACA 


CCTGCCACCT 


2460 


CCCCAGCCAC 


C ATCT AC C ACj 




GAGAACGCCA 


AGACTGGGTT 


CCCGCCACAG 


2520 


TGCTATGCCT 


C C C AGTT C CA 


r*^ a rT a pnGT 

UuAL J- At-vjO X 


f PTC C AGG AG 


CTCAAAAGGT 


GTCAGGCGTG 


2580 


GCCAGTCGAC 


rrt At At At At At A 1 * 

TGCTGGGGCL 


a T , r , n r r r pr , G a g 


PPTTACCTGT 


TGCCGGAACT 


GACCAGATAT 


2640 


GACTGTGAGG 


TGAACGTGCC 


r*r*TT* r^r^'vnn a 
I 1 ubM. 


AGPTPPAPAC 


TCCTGCAGGG 


GAGAGACCTT 


2700 


CTCAGAGCTC 




PAGCTGAG C C 


AGGGCCTCTG 


GCCGGGCATG 


CCCCTGCCTG 


2760 


CCCCGCCGTC 


TTGACCTGCC 


AG CTT CACTT 


A"* A"» 1\ npA-irT^/-«rpA>irTi 


TGPTATTAGG 


TATCTCTAAC 


2820 


ACCAGCACAC 


TTCTTACGAG 


AT(STACTCAA 


CCTGGCCTAC 


TGGCCAGGTC 


ACCAAGCAGT 


2880 


GGCCTTTATC 


TGACATGCTC 


ACTTTATTAT 


CCATGTTTTA 


AAAATACATA 


GTTGTTGTAC 


2940 


CTGCTATGTT 


TTACCGTTGA 


TGAAAGTGTT 


CTGAAATTTT 


ATAAGATTTC 


CCCCTCCCTC 


3000 


CCTCCCTTGA 


ATTACTTCTA 


ATTTATATTC 


C 






3031 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS-: double 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

18 

GCCCTACGTG CTGTCTCA 

5 (2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 870 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
10 (D)- TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Thr Ala Asp Lys Glu Lys Lys Arg Ser Ser Ser Glu Arg Arg Lys 
x 5 10 15 

15 Glu Lys Ser Arg Asp Ala Ala Arg Cys Arg Arg Ser Lys Glu Thr Glu 

20 25 30 

Val Phe Tyr Glu Leu Ala His Glu Leu Pro Leu Pro His Ser Val Ser 

35 40 45 

Ser His Leu Asp Lys Ala Ser lie Met Arg Leu Glu He Ser Phe Leu 
20 50 55 SO 

Arg Thr His Lys Leu Leu Ser Ser Val Cys Ser Glu Asn Glu Ser Glu 
65 70 75 ao 

Ala Glu Ala Asp Gin Gin Met Asp Asn Leu Tyr Leu Lys Ala Leu Glu 
85 90 95 

25 ciy Phe He Ala Val Val Thr Gin Asp Gly Asp Met He Phe Leu Ser 

100 105 110 

Glu Asn He Ser Lys Phe Met Gly Leu Tbr Gin Val Glu Leu Thr Gly 

115 120 I 25 

His Ser He Phe Asp Phe Thr His Pro Cys Asp His Glu Glu He Arg 
30 130 135 140 

Glu Asn Leu Ser Leu Lys Asn Gly Ser Gly Phe Gly Lys Lys Ser Lys 
145 ISO 155 I" 

Asp Met Ser Thr Glu Arg Asp Phe Phe Met Arg Met Lys Cys Thr Val 
165 170 175 

35 T hr Asn Arg Gly Arg Thr Val Asn Leu Lys Ser Ala Thr Trp Lys Val 

180 185 190 

Leu His Cys Thr Gly Gin Val Lys Val Tyr Asn Asn Cys Pro Pro His 

27 
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195 200 205 

Asn Ser Leu Cys Gly Tyr Lys Glu Pro Leu Leu Ser Cys Leu He He 

210 215 220 

Met Cys Glu Pro He Gin His Pro Ser His Met Asp He Pro Leu Asp 
225 230 235 240 

5 Ser Lys Thr Phe Leu Ser Arg His Ser Met Asp Met Lys Phe Thr Tyr 

245 250 255 

Cys Asp Asp Arg He Thr Glu Leu He Gly Tyr His Pro Glu Glu Leu 

260 265 270 

Leu Gly Arg Ser Ala Tyr Glu Phe Tyr His Ala Leu Asp Ser Glu Asn 
10 275 280 285 

Met Thr Lys Ser His Gin Asn Leu Cys Thr Lys Gly Gin Val Val Ser 

290 295 300 

Gly Gin Tyr Arg Met Leu Ala Lys His Gly Gly Tyr Val Trp Leu Glu 
305 310 315 320 

15 Thr Gin Gly Thr Val He Tyr Asn Pro Arg Asn Leu Gin Pro Gin Cys 

325 330 335 

lie Met Cys Val Asn Tyr Val Leu Ser Glu He Glu Lys Asn Asp Val 

340 345 350 

Val Phe Ser Met Asp Gin Thr Glu Ser Leu Phe Lys Pro His Leu Met 
20 355 360 365 

Ala Met Asn Ser He Phe Asp Ser Ser Gly Lys Gly Ala Val Ser Glu 

370 375 380 

Lys Ser Asn Phe Leu Phe Thr Lys Leu Lys Glu Glu Pro Glu Glu Leu 
385 390 395 400 

25 Ala Gin Leu Ala Pro Thr Pro Gly Asp Ala He He Ser Leu Asp Phe 

405 410 415 

Gly Asn Gin Asn Phe Glu Glu Ser Ser Ala Tyr Gly Lys Ala He Leu 

420 425 430 

Pro Pro Ser Gin Pro Trp Ala Thr Glu Leu Arg Ser His Ser Thr Gin 
30 435 440 445 

Ser Glu Ala Gly Ser Leu Pro Ala Phe Thr Val Pro Gin Ala Ala Ala 

450 455 460 

Pro Gly Ser Thr Thr Pro Ser Ala Thr Ser Ser Ser Ser Ser Cys Ser 
465 470 475 480 

35 Thr Pro Asn Ser Pro Glu Asp Tyr Tyr Thr Ser Leu Asp Asn Asp Leu 

485 490 495 

Lys He Glu Val He Glu Lys Leu Phe Ala Met Asp Thr Glu Ala Lys 
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35 



500 505 510 

Asp Gin Cys Ser Thr Gin Thr Asp Phe Asn Glu Leu Asp Leu Glu Thr 

515 520 525 

Leu Ala Pro Tyr He Pro Met Asp Gly Glu Asp Phe Gin Leu Ser Pro 

530 535 540 

5 He Cys pro Glu Glu Arg Leu Leu Ala Glu Asn Pro Gin Ser Thr Pro 

545 550 555 560 

Gin His Cys Phe Ser Ala Met Thr Asn He Phe Gin Pro Leu Ala Pro 

565 570 575 

Val Ala Pro His Ser Pro Phe Leu Leu Asp Lys Phe Gin Gin Gin Leu 

10 580 585 590 

Glu Ser Lys Lys Thr Glu Pro Glu His Arg Pro Met Ser Ser He Phe 

S95 600 605 

Phe Asp Ala Gly Ser Lys Ala Ser Leu Pro Pro Cys Cys Gly Gin Ala 

610 615 620 

15 Ser Thr Pro Leu Ser Ser Met Gly Gly Arg Ser Asn Thr Gin Trp Pro 

625 630 635 640 

Pro Asp Pro Pro Leu His Phe Gly Pro Thr Lys Trp Ala Val Gly Asp 

645 650 655 

Gin Arg Thr Glu Phe Leu Gly Ala Ala Pro Leu Gly Pro Pro Val Ser 

20 660 665 670 

Pro Pro His Val Ser Thr Phe Lys Thr Arg Ser Ala Lys Gly Phe Gly 

675 680 685 

Ala Arg Gly Pro Asp Val Leu Ser Pro Ala Met Val Ala Leu Ser Asn 

690 695 700 

25 Lys Leu Lys Leu Lys Arg Gin Leu Glu Tyr Glu Glu Gin Ala Phe Gin 

•7i n 715 720 

705 710 'J- 3 

Asp Leu Ser Gly Gly Asp Pro Pro Gly Gly Ser Thr Ser His Leu Met 

725. 730 735 

Trp Lys Arg Met Lys Asn Leu Arg Gly Gly Ser Cys Pro Leu Met Pro 
30 740 745 7 50 

Asp Lys Pro Leu Ser Ala Asn Val Pro Asn Asp Lys Phe Thr Gin Asn 

755 760 765 

Pro Met Arg Gly Leu Gly His Pro Leu Arg His Leu Pro Leu Pro Gin 



770 



775 



780 



Pro Pro Ser Ala He Ser Pro Gly Glu Asn Ser Lys Ser Arg Phe Pro 
785 790 795 800 

Pro Gin Cys Tyr Ala Thr Gin Tyr Gin Asp Tyr Ser Leu Ser Ser Ala 
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805 



810 815 



20 



His Lys val Ser Gly Met Ala Ser Arg Leu Leu Gly Pro Ser Phe Glu 



820 



825 830 



Ser Tyr Leu Leu Pro Glu Leu Thr Arg Tyr Asp Cys Glu Val Asn Val 



Pro 



835 840 845 

Val Leu Gly Ser Ser Thr Leu Leu Gin Gly Gly Asp Leu Leu Arg 
850 855 860 

Ala. Leu Asp Gin Ala Thr 
865 870 

10 (2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 875 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Thr Ala Asp Lys Glu Lys Lys Arg Ser Ser Ser Glu Leu Arg Lys 

5 10 15 

Glu Lys ser Arg Asp Ala Ala Arg Cys Arg Arg Ser Lys Glu Thr Glu 

20 25 30 

Val Phe Tyr Glu Leu Ala His Glu Leu Pro Leu Pro His Ser Val Ser 

35 40 45 

Ser His Leu Asp Lys Ala Ser He Met Arg Leu Ala He Ser Phe Leu 



25 50 55 60 

Arg Thr His Lys Leu Leu Ser Ser Val Cys Ser Glu Asn Glu Ser Glu 

Ala Glu Ala Asp Gin Gin Met Asp Asn Leu Tyr Leu Lys Ala Leu Glu 
85 90 95 

30 Gly Phe He Ala val Val Thr Gin Asp Gly Asp Met lie Phe Leu Ser 

100 105 HO 

Glu Asn lie Ser Lys Phe Met Gly Leu Thr Gin Val Glu Leu Thr Gly 

115 120 125 

His Ser He Phe Asp Phe Thr His Pro Cys Asp His Glu Glu He Arg 
35 130 135 I 40 

Glu Asn Leu Thr Leu Lys . Asn Gly Ser Gly Phe Gly Lys Lys Ser Lys 
145 150 155 I" 
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Asp Val Ser Thr Glu Arg Asp Phe Phe Met Arg Met Lys Cys Thr Val 

165 170 175 

Thr Asn Arg Gly Arg Thr Val Asn Leu Lys Ser Ala Thr Trp Lys Ser 
180 185 190 



Va 



1 Leu His Cys Thr Gly Gin Val Arg Val Tyr Asn Asn Cys Pro Pro 
5 195 200 205 

His Ser Ser Leu Cys Gly Ser Lys Glu Pro Leu Leu Ser Cys Leu He 

215 220 

210 

He Met Cys Glu Pro He Gin His Pro Ser His Met Asp lie Pro Leu 

225 230 235 240 

10 Asp Ser Lys Thr Phe Leu Ser Arg His Ser Met Asp Met Lys Phe Thr 

245 250 255 

Tyr Cys Asp Asp Arg He Leu Glu Leu He Gly Tyr His Pro Glu Glu 

260 265 270 

Leu Leu Gly Arg Ser Ala Tyr Glu Phe Tyr His Ala Leu Asp Ser Glu 

15 275 * 280 285 

Asn Met Thr Lys Ser His Gin Asn Leu Cys Thr Lys Gly Gin Val Val 

290 295 300 

Ser Gly Gin Tyr Arg Met Leu Ala Lys His Gly Gly Tyr Val Trp Leu 

-»m 315 320 

305 310 

20 Glu Thr Gin Gly Thr Val He Tyr Asn Pro Arg Asn Leu Gin Pro Gin 

325 330 335 

Cys lie Met Cys Val Asn Tyr Val Leu Ser Glu He Glu Lys Asn Asp 

340 345 350 

Val val Phe Ser Met Asp Gin Thr Glu Ser Leu Phe Lys Pro His Leu 
25 355 360 365 

Met Ala Met Asn Ser He Phe Asp Ser Ser Asp Asp Val Ala Val Thr 

370 375 380 

Glu Lys Ser Asn Tyr- Leu Phe Thr Lys Leu Lys Glu Glu Pro Glu Glu 
385 390 395 400 

30 Leu Ala Gin Leu Ala Pro Thr Pro Gly Asp Ala He He Ser Leu Asp 

405 4X0 415 

Phe Gly Ser Gin Asn Phe Asp Glu Pro Ser Ala Tyr Gly Lys Ala He 

420 425 430 

Leu Pro Pro Gly Gin Pro Trp Val Ser Gly Leu Arg Ser His Ser Ala 
35 435 440 445 

Gin Ser Glu Ser Gly Ser Leu Pro Ala Phe Thr Val Pro Gin Ala Asp 
450 455 460 
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Thr Pro Gly Asn Thr Thr Pro Ser Ala Ser Ser Ser Ser Ser Cys Ser 
465 470 475 480 

Thr Pro Ser Ser Pro Glu Asp Tyr Tyr Ser Ser Leu Glu Asn Pro Leu 

485 490 495 

Lys He Glu val He Glu Lys Leu Phe Ala Met Asp Thr Glu Pro Arg 

500 505 510 

Asp Pro Gly Ser Thr Gin Thr Asp Phe Ser Glu Leu Asp Leu Glu Thr 

S15 520 525 

Leu Ala Pro Tyr lie Pro Met Asp Gly Glu Asp Phe Gin Leu Ser Pro 

530 535 540 

He Cys Pro Glu Glu Pro Leu Met Pro Glu Ser Pro Gin Pro Thr Pro 
545 550 555 560 

Gin His Cys Phe Ser Thr Met Thr ser He Phe Gin Pro Leu Thr Pro 

565 570 575 

Gly Ala Thr His Gly Pro Phe Phe Leu Asp Lys Tyr Pro Gin Gin Leu 

580 585 590 

Glu ser Arg Lys Thr Glu Ser Glu His Trp Pro Met Ser Ser He Phe 

595 600 605 

Phe Asp Ala Gly Ser Lys Gly Ser Leu Ser Pro Cys Cys Gly Gin Ala 

610 615 620 

Ser Thr Pro Leu Ser Ser Met Gly Gly Arg Ser Asn Thr Gin Trp Pro 
625 630 635 640 

Pro Asp Pro Pro Leu His Phe Gly Pro Thr Lys Trp Pro Val Gly Asp 

645 650 655 

Gin Ser Ala Glu Ser Leu Gly Ala Leu Pro Val Gly Ser Trp Gin Leu 

660 665 670 

Glu Leu Pro ser Ala Pro Leu His Val Ser Met Phe Lys Met Arg Ser 

675 680 685 

Ala Lys Asp Phe Gly Ala Arg Gly Pro Tyr Met Met Ser Pro Ala Met 

690 695 700 

He Ala Leu Ser Asn Lys Leu Lys Leu Lys Arg Gin Leu Glu Tyr Glu 
.705 710 715 720 

Glu Gin Ala Phe Gin Asp Thr Ser Gly Gly Asp Pro Pro Gly Thr Ser 

725 730 735 

Ser Ser His Leu Met Trp Lys Arg Met Lys Ser Leu Met Gly Gly Thr 

740 745 750 

Cys Pro Leu Met Pro Asp Lys Thr He Ser Ala Asn Met Ala Pro Asp 
755 760 7 65 
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Glu Phe Thr Gin Lys Ser Met Arg Gly Leu Gly Gin Pro Leu Arg His 

770 775 780 

Leu Pro Pro Pro Gin Pro Pro Ser Thr Arg Ser Ser Gly Glu Asn Ala 
7Q5 790 795 800 

Lys Thr Gly Phe Pro Pro Gin Cys Tyr Ala Ser Gin Phe Gin Asp Tyr 
5 805 810 815 

Gly Pro Pro Giy Ala Gin Lys Val Ser Gly Val Ala Ser Arg Leu Leu 

820 825 830 

Gly Pro Ser Phe Glu Pro Tyr Leu Leu Pro Glu Leu Thr Arg Tyr Asp 

835 840 845 

10 cys Glu val Asn Val Pro Val Pro Gly Ser Ser Thr Leu Leu Gin Gly 

850 855 860 

Arg Asp Leu Leu Arg Ala Leu Asp Gin Ala Thr 
865 870 875 

15 (2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Met Glu Gly Ala Gly Gly Ala Asn Asp Lys Lys Lys He Ser Ser Glu 
! 5 10 15 

25 Arg Arg Lys Glu Lys Ser Arg Asp Ala Ala Arg Ser Arg Arg Ser Lys 

20 25 30 

Glu Ser Glu Val Phe Tyr Glu Leu Ala His Gin Leu Pro Leu Pro His 

35 40 45 

Asn Val Ser Ser His Leu Asp Lys Ala Ser Val Met Arg Leu Thr He 
30 50 55 60 

Ser Tyr Leu Arg Val Arg Lys Leu Leu Asp Ala Gly Asp Leu Asp He 
65 ' 70 7 * 80 

Glu Asp Asp Met Lys Ala Gin Met Asn Cys Phe Tyr Leu Lys Ala Leu 
85 90 95 

35 Asp Gly Phe Val Met Val Leu Thr Asp Asp Gly Asp Met He Tyr He 

100 105 HO 

Ser Asp Asn Val Asn Lys Tyr Met Gly Leu Thr Gin Phe Glu Leu Thr 
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115 120 125 

Gly His ser Val Phe Asp Phe Thr His Pro Cys Asp His Glu Glu Met 

130 "5 140 

Arg Glu Met Leu Thr His Arg Asn Gly Leu Val Lys Lys Gly Lys Glu 
145 150 155 160 

Gin Asn Thr Gin Arg Ser Phe Phe Leu Arg Met Lys Cys Thr Leu Thr 

165 I 70 175 

Ser Arg Gly Arg Thr Met Asn lie Lys Ser Ala Thr Trp Lys Val Leu 

180 185 190 

His Cys Thr Gly His He His Val Tyr Asp Thr Asn Ser Asn Gin Pro 

195 200 205 

Gin Cys Gly Tyr Lys Lys Pro Pro Met Thr Cys Leu Val Leu lie Cys 

210 215 220 

Glu Pro He Pro His Pro Ser Asn lie Glu lie Pro Leu Asp Ser Lys 
225 230 235 240 

Thr Phe Leu Ser Arg His Ser Leu Asp Met Lys Phe Ser Tyr Cys Asp 

245 250 255 

Glu Arg He Thr Glu Leu Met Gly Tyr Glu Pro Glu Glu Leu Leu Gly 

260 265 270 

Arg Ser He Tyr Glu Tyr Tyr His Ala Leu Asp Ser Asp His Leu Thr 

275 280 285 

Lys Thr His His Asp Met Phe Thr Lys Gly Gin Val Thr Thr Gly Gin 

290 295 300 

Tyr Arg Met Leu Ala Lys Arg Gly Gly Tyr Val Trp Val Glu Thr Gin 
305 310 315 320 

Ala Thr Val He Tyr Asn Thr Lys Asn Ser Gin Pro Gin Cys He Val 

325 330 335 

Cys Val Asn Tyr Val Val Ser Gly He He Gin His Asp Leu He Phe 

340 345 350 

Ser Leu Gin Gin Thr Glu Cys Val Leu Lys Pro Val Glu Ser Ser Asp 

355 360 365 

Met Lys Met Thr Gin Leu Phe Thr Lys Val Glu Ser Glu Asp Thr Ser 

370 375 380 

Ser Leu Phe Asp Lys Leu Lys Lys Glu Pro Asp Ala Leu Thr Leu Leu 
385 390 395 400 

Ala Pro Ala Ala Gly Asp Thr He He Ser Leu Asp Phe Gly Ser Asn 

405 410 415 

Asp Thr Glu Thr Asp Asp Gin Gin Leu Glu Glu Val Pro Leu Tyr Asn 
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420 



425 430 



Leu 



Asp val Met Leu Pro Ser Pro Asn Glu Lys Leu Gin Asn He Asn 

435 440 445 

Ala Met Ser Pro Leu Pro Thr Ala Glu Thr Pro Lys Pro Leu Arg Ser 
450 4 *S 460 

5 ser Ala Asp Pro Ala Leu Asn Gin Glu Val Ala Leu Lys Leu Glu Pro 

Ann 475 480 

465 470 

Asn Pro Glu Ser Leu Glu Leu Ser Phe Thr Met Pro Glri He Gin Asp 

485 490 495 

Gin Thr Pro Ser Pro Ser Asp Gly Ser Thr Arg Gin Ser Ser Pro Glu 
10 500 505 510 

Pro Asn Ser Pro Ser Glu Tyr Cys Phe Tyr Val Asp Ser Asp Met Val 

515 520 525 

Asn Glu Phe Lys Leu Glu Leu Val Glu Lys Leu Phe Ala Glu Asp Thr 
530 535 540 

15 Glu Ala Lys Asn Pro Phe Ser Thr Gin Asp Thr Asp Leu Asp Leu Glu 

545 550 555 560 

Met Leu Ala Pro Tyr He Pro Met Asp Asp Asp Phe Gin Leu Arg Ser 

565 570 575 

Phe Asp Gin Leu Ser Pro Leu Glu Ser Ser Ser Ala Ser Pro Glu Ser 
20 580 585 590 

Ala ser Pro Gin Ser Thr Val Thr Val Phe Gin Gin Thr Gin He Gin 

595 600 605 

Glu Pro Thr Ala Asn Ala Thr Thr Thr Thr Ala Thr Thr Asp Glu Leu 
610 615 620 

25 Lys Thr Val Thr Lys Asp Arg Met Glu Asp He Lys He Leu He Ala 

S25 630 635 640 

Ser Pro Ser Pro Thr His He His Lys Glu Thr Thr Ser Ala Thr Ser 

645. 650 655 

Ser Pro Tyr Arg Asp Thr Gin Ser Arg Thr Ala Ser Pro Asn Arg Ala 
30 660 665 670 

Gly Lys Gly Val He Glu Gin Thr Glu Lys Ser His Pro Arg Ser Pro 

675 680 685 

Asn Val Leu Ser Val Ala Leu Ser Gin Arg Thr Thr Val Pro Glu Glu 
690 695 7 <> 0 

35 Glu Leu Asn Pro Lys He Leu Ala Leu Gin Asn Ala Gin Arg Lys Arg 

_„ . tic 720 

705 710 71 

Lys Met Glu His Asp Gly Ser Leu Phe Gin Ala Val Gly He Gly Thr 
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725 730 735 

Leu Leu Gin Gin Pro Asp Asp His Ala Ala Thr Thr Ser Leu Ser Trp 

740 745 750 

Lys Arg Val Lys Gly Cys Lys Ser Ser Glu Gin Asn Gly Met Glu Gin 
755 760 765 

5 Lys Thr lie He Leu He Pro Ser Asp Leu Ala Cys Arg Leu Leu Gly 

770 775 780 

Gin Ser Met Asp Glu Ser Gly Leu Pro Gin Leu Thr Ser Tyr Asp Cys 
785 790 795 800 

Glu Val Asn Ala Pro He Gin Gly Ser Arg Asn Leu Leu Gin Gly Glu 
10 805 810 815 

Glu Leu Leu Arg Ala Leu Asp Gin Val Asn 
820 825 

(2) INFORMATION FOR SEQ ID NO : 7 : 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
20 (ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Ser Ser Glu Arg Arg Lys Glu Lys Ser Arg Asp Ala Ala Arg Ser 
! 5 10 15 

Arg Arg Thr Lys Glu Ser Glu Val Phe Tyr Glu Leu Ala His Gin Leu 
25 20 25 30 

Pro Leu Pro His Asn Val Ser Ser His Leu Asp Lys Ala Ser Val Met 

35 40 45 

Arg Leu Thr He Ser Tyr Leu Arg Val Arg Lys Leu Leu Asp Ala Gly 
50 55 60 

30 Gly Leu Asp Ser Glu Asp Glu Met Lys Ala Gin Met Asp Cys Phe Tyr 

65 70 75 80 

Leu Lys Ala Leu Asp Gly Phe Val Met Val Leu Thr Asp Asp Gly Asp 

85 90 95 

Met Val Tyr lie Ser Asp Asn Val Asn Lys Tyr Met Gly Leu Thr Gin 
35 100 105 HO 

Phe Glu Leu Ala Gly His Ser Val Phe Asp Phe Thr His Pro Cys Asp 
115 120 125 
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His Glu Glu Met Arg Glu Met Leu Thr His Arg Asn Gly Pro Val Arg 

130 «5 140 

Lys Gly Lys Glu Leu Asn Thr Gin Arg Ser Phe Phe Leu Arg Met Lys 
145 150 155 160 

Cys Thr Leu Thr Ser Arg Gly Arg Thr Met Asn He Lys Ser Ala Thr 
5 .165 170 175 

Trp Lys Val Leu His Cys Thr Gly His He His Val Tyr Asp Thr Asn 

180 185 190 

Ser Asn Gin Pro Gin Cys Gly Tyr Lys Lys Pro Pro Met Thr Cys Leu 
195 200 20S 

1Q Val Leu Ile Cys Glu Pro He Pro His Pro Ser Asn He Glu He Pro 

210 215 220 

Leu Asp Ser Lys Thr Phe Leu Ser Arg His Ser Leu Asp Met Lys Phe 
225 230 235 240. 

Ser Tyr Cys Asp Glu Arg Ile Thr Glu Leu Met Gly Tyr Glu Pro Glu 
15 245 250 255 

Glu Leu Leu Gly Arg Ser Ile Tyr Glu Tyr Tyr His Ala Leu Asp Ser 

260 265 270 

Asp His Leu Thr Lys Thr His His Asp Met Phe Thr Lys Gly Gin Val 
275 280 285 

20 Thr Thr Gly Gin Tyr Arg Met Leu Ala Lys Arg Gly Gly Tyr Val Trp 

290 295 300 

Val Glu Thr Gin Ala Thr Val Ile Tyr Asn Thr Lys Asn Ser Gin Pro 
305 310 315 320 

Gin Cys He Val Cys Val Asn Tyr Val Val Ser Gly Ile He Gin His 
25 325 330 335 

Asp Leu Ile Phe Ser Leu Gin Gin Thr Glu Ser Val Leu Lys Pro Val 

340 345 350 

Glu Ser Ser Asp Met. Lys Met Thr Gin Leu Phe Thr Lys Val Glu Ser 
355 360 365 

30 Glu Asp Thr Ser Cys Leu Phe Asp Lys Leu Lys Lys Glu Pro Asp Ala 

370 375 380 

Leu Thr Leu Leu Ala Pro Ala Ala Gly Asp Thr Ile He Ser Leu Asp 
385 390 395 400 

Phe Gly Ser Asp Asp Thr Glu Thr Glu Asp Gin Gin Leu Glu Asp Val 
35 405 410 415 

Pro Leu Tyr Asn Asp Val Met Phe Pro Ser Ser Asn Glu Lys Leu Asn 
420 425 430 
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He Asn Leu Ala Met Ser Pro Leu Pro Ser Ser Glu Thr Pro Lys Pro 

435 440 445 

Leu Arg Ser Ser Ala Asp Pro Ala Leu Asn Gin Glu Val Ala Leu Lys 

450 455 460 

Leu Glu Ser Ser Pro Glu Ser Leu Gly Leu Ser Phe Thr Met Pro Gin 
5 465 . 470 475 480 

He Gin Asp Gin Pro Ala Ser Pro Ser Asp Gly Ser Thr Arg Gin Ser 

485 490 495 

Ser Pro Glu Pro Asn Ser Pro Ser Glu Tyr Cys Phe Asp Val Asp Ser 
500 505 510 

10 Asp Met Val Asn Val Phe Lys Leu Glu Leu Val Glu Lys Leu Phe Ala 

515 520 525 

Glu Asp Thr Glu Ala Lys Asn Pro Phe Ser Thr Gin Asp Thr Asp Leu 

530 535 540 

Asp Leu Glu Met Leu Ala Pro Tyr He Pro Met Asp Asp Asp Phe Gin 
15 545 550 555 560 

Leu Arg Ser Phe Asp Gin Leu Ser Pro Leu Glu Ser Asn Ser Pro Ser 

565 570 S75 

Pro Pro Ser Met Ser Thr Val Thr Gly Phe Gin Gin Thr Gin Leu Gin 
580 585 590 

20 Lys Pro Thr He Thr Ala Thr Ala Thr Thr Thr Ala Thr Thr Asp Glu 

595 600 605 

Ser Lys Thr Glu Thr Lys Asp Asn Lys Glu Asp He Lys He Leu lie 

610 .615 620 

Ala Ser Pro ser Ser Thr Gin Val Pro Gin Glu Thr Thr Thr Ala Lys 
25 625 630 635 640 

Ala Ser Ala Tyr Ser Gly Thr His Ser Arg Thr Ala Ser Pro Asp Arg 

645 650 655 

Ala Gly Lys Arg Val lie Glu Gin Thr Asp Lys Ala His Pro Arg Ser 
660 665 670 

30 Leu Asn Leu Ser Ala Thr Leu Asn Gin Arg Asn Thr Val Pro Glu Glu 

675 680 685 

Glu Leu Asn Pro Lys Thr He Ala Ser Gin Asn Ala Gin Arg Lys Arg 

690 695 700 

Lys Met Glu His Asp Gly Ser Leu Phe Gin Ala Ala Gly He Gly Thr 
35 705 710 715 720 

Leu Leu Gin Gin Pro Gly Asp Cys Ala Pro Thr Met Ser Leu Ser Trp 
725 730 735 
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Lys Arg Val Lys Gly Phe He Ser 
740 

Lys Thr He He Leu He Pro Ser 
755 760 
Gin Ser Met Asp Val Ser Gly Leu 

770 . 775 
Glu Val Asn Ala Pro He Gin Gly 

785 790 
Glu Leu Leu Arg Ala Leu Asp Gin 
805 



Ser Glu Gin Asn Gly Thr Glu Gin 
745 750 
Asp Leu Ala Cys Arg Leu Leu Gly 
765 

Pro Gin Leu Thr Ser Tyr Asp Cys 
780 

Ser Arg Asn Leu Leu Gin Gly Glu 
795 800 

Val Asn 
810 
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WHAT IS CLAIMED IS: 

1. An isolated protein comprising a endothelial PAS domain protein 1 (EPAS1) protein 
(SEQ ID NO: 4 or 5), or an EPAS1 protein domain thereof having at least 14 consecutive amino 
acids of SEQ ID NO: 4 or 5 and an EPAS1 -specific activity. 

5 2. An isolated protein according to claim 1, wherein said protein specifically binds at least 
one of a bHLH/PAS protein, a heat shock protein, or a nucleic acid consisting of SEQ ID NO: 3. 

3. A recombinant nucleic acid encoding a protein according to claim 1 . 

10 4. A cell comprising a nucleic acid according to claim 3. 

5. A method of making an isolated EPAS1 protein, comprising steps: introducing a nucleic 
acid according to claim 3 into a host cell or cellular extract, incubating said host cell or extract 
under conditions whereby said nucleic acid is expressed as a transcript and said transcript is 

1 5 expressed as a translation product comprising said protein, and isolating said translation product. 

6. An isolated EPAS1 protein made by the method of claim 5. 

7. An isolated EPAS 1 nucleic acid comprising SEQ ID NO: 1 or 2, or a fragment thereof 
20 having at least 24 consecutive bases of SEQ ID NO: 1 or 2 and sufficient to specifically hybridize 

with a nucleic acid having the sequence defined by the corresponding SEQ ID NO: 1 or 2 in the 
presence of human or murine genomic DNA, respectively. 

8. An isolated EPAS1 nucleic acid according to claim 7, said nucleic acid comprising SEQ 
25 ID NO:l, or a fragment thereof having at least 24 consecutive bases of SEQ ID NO:l and 

sufficient to specifically hybridize with a nucleic acid having the sequence defined by SEQ ID 
NO:l in the presence of human genomic DNA. 

9. An isolated EPAS 1 nucleic acid according to claim 7, said nucleic acid comprising SEQ 
30 ID NO:2, or a fragment thereof having at least 24 consecutive bases of SEQ ID NO:2 and 

sufficient to specifically hybridize with a nucleic acid having the sequence defined by SEQ ID 
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NO:2 in the presence of murine genomic DNA. 

10. A method of screening for an agent which modulates the binding of a EP AS 1 protein to 
a binding target, said method comprising the steps of: 
incubating a mixture comprising: 
5 an isolated protein according to claim 1, 

a binding target of said protein, and 
a candidate agent; 

under conditions whereby, but for the presence of said agent, said protein specifically 
binds said binding target at a reference affinity; 
10 detecting the binding affinity of said protein to said binding target to determine an agent- 

biased affinity, 

wherein a difference between the agent-biased affinity and the reference affinity indicates 
that said agent modulates the binding of said protein to said binding target. 

x 5 1 1 . a method according to claim 1 0, wherein said binding target is a one of a bHLH/PAS 
protein, a heat shock protein, or a nucleic acid consisting of SEQ ID NO:3. 
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